Semantic Type Classification of Common Words in Biomedical Noun Phrases

نویسندگان

  • Amy Siu
  • Gerhard Weikum
چکیده

Complex noun phrases are pervasive in biomedical texts, but are largely underexplored in entity discovery and information extraction. Such expressions often contain a mix of highly specific names (diseases, drugs, etc.) and common words such as “condition”, “degree”, “process”, etc. These words can have different semantic types depending on their context in noun phrases. In this paper, we address the task of classifying these common words onto fine-grained semantic types: for instance, “condition” can be typed as “symptom and finding” or “configuration and setting”. For information extraction tasks, it is crucial to consider common nouns only when they really carry biomedical meaning; hence the classifier must also detect the negative case when nouns are merely used in a generic, uninformative sense. Our solution harnesses a small number of labeled seeds and employs label propagation, a semisupervised learning method on graphs. Experiments on 50 frequent nouns show that our method computes semantic labels with a microaveraged accuracy of 91.34%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structural Analysis of the Descriptive and Additional Compounds in Ghazaliat-e Shams

Linguistic combinations are referred to as a chain of words that associate with one another and form a semantic phrases, such as adjectival and noun. Innovation in this field is one of the main tasks of creative poetry. Shams's sonnets are of the prominent works of Persian literature and their composition is characterized by its stylistic features. In order to express his sublime and mystical t...

متن کامل

Pre-Noun Modifiers as a Means to Describe Product Characteristics

The article deals with the characteristics of noun phrases functioning in pre-position. Pre-noun modifiers, or attributive modifiers, are among the high-capacity structures in the contemporary English language, as they enable to devoid of prepositions and to describe objects in a laconic way. The goal of this research was to analyze the way scholars interpret this linguistic structure today, as...

متن کامل

Extracting Semantic Orientations of Phrases from Dictionary

We propose a method for extracting semantic orientations of phrases (pairs of an adjective and a noun): positive, negative, or neutral. Given an adjective, the semantic orientation classification of phrases can be reduced to the classification of words. We construct a lexical network by connecting similar/related words. In the network, each node has one of the three orientation values and the n...

متن کامل

Automatic Classification of Previously Unseen Proper Noun Phrases into Semantic Categories Using an N-Gram Letter Model

We investigate the ability to automatically classify previously unseen proper noun phrases as drug names, company names, movie titles, and place names. The classifier relies on learning probabilistic features of each of the categories, the most important being a linearly interpolated 4-gram letter model of the proper nouns. In addition, prior probabilities, suffix probabilities, phrase length, ...

متن کامل

Context effects of pictures and words in naming objects, reading words, and generating simple phrases.

In five language production experiments it was examined which aspects of words are activated in memory by context pictures and words. Context pictures yielded Stroop-like and semantic effects on response times when participants generated gender-marked noun phrases in response to written words (Experiment 1A). However, pictures yielded no such effects when participants simply read aloud the noun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015